Data Exchange : Getting to the Core ∗ RONALD
نویسندگان
چکیده
Data exchange is the problem of taking data structured under a source schema and creating an instance of a target schema that reflects the source data as accurately as possible. Given a source instance, there may be many solutions to the data exchange problem, that is, many target instances that satisfy the constraints of the data exchange problem. In an earlier paper, we identified a special class of solutions that we call universal. A universal solution has homomorphisms into every possible solution, and hence is a “most general possible” solution. Nonetheless, given a source instance, there may be many universal solutions. This naturally raises the question of whether there is a “best” universal solution, and hence a best solution for data exchange. We answer this question by considering the well-known notion of the core of a structure, a notion that was first studied in graph theory, and has also played a role in conjunctive-query processing. The core of a structure is the smallest substructure that is also a homomorphic image of the structure. All universal solutions have the same core (up to isomorphism); we show that this core is also a universal solution, and hence the smallest universal solution. The uniqueness of the core of a universal solution together with its minimality make the core an ideal solution for data exchange. We investigate the computational complexity of producing the core. Well-known results by Chandra and Merlin imply that, unless P = NP, there is no polynomial-time algorithm that, given a structure as input, returns the core of that structure as output. In contrast, in the context of data exchange, we identify natural and ∗To appear in ACM Transactions on Database Systems. A preliminary version of this paper appeared in Proc. 2003 ACM Symposium of Principles of Database Systems, San Diego, pp. 90– 101. †On leave from UC Santa Cruz; partially supported by NSF Grant IIS-9907419.
منابع مشابه
Tourism Education and Training Models Getting to the Core of Destination Planning and Managemet
متن کامل
The Effect of Exchange Rate, Oil Prices and Global Inflation Shocks on Macroeconomic Variables for the Iranian Economy in the form of a DSGE Model
T he world economy has experienced a bulk of positive and negative shocks in crude oil prices and exchange rates over the years, and that global inflation has undergone some changes. Such shocks have affected the macroeconomic variables in the countries of the world and have challenged the economies of these countries, and have led them to take different measures to protect themselves agai...
متن کاملNetwork Analysis of Interpersonal Relationships in Tehran Stock Exchange
The stock market has an important role in growth and development of countries. Network analysis is one of the latest method in analyzing the stock market. In quantitative science literature, It is a new concept for a macro view to whole market. Therefore, this research analyzes the interpersonal relationships’ network in the Tehran Stock Exchange (TSE). From the type of data collected and analy...
متن کاملCation Exchange Between Piemontite and Garnet in Piemontite-Quartz Schists from Asemi-Gawa Area of Central Shikoku, Sanbagawa Metamorphic Belt, Japan
In the Asemi-gawa area of intermediate high-pressure and low-temperature Sanbagawa metamorphic belt in central Shikoku, Japan, piemontite-quartz schists are common in which cation exchange between piemontite and garnet has been studied. Piemontite in contact with garnet usually contains two zones in which core is enriched in Mn3+ and surrounded by a rim rich in Fe3+. Garnet is Ca-Fe-bearing spe...
متن کاملMPI- and CUDA- implementations of modal finite difference method for P-SV wave propagation modeling
Among different discretization approaches, Finite Difference Method (FDM) is widely used for acoustic and elastic full-wave form modeling. An inevitable deficit of the technique, however, is its sever requirement to computational resources. A promising solution is parallelization, where the problem is broken into several segments, and the calculations are distributed over different processors. ...
متن کامل